Search CORE

21 research outputs found

On the Distance of Databases

Author: G. Burosch
H. Boutselakis
H.M. Berman
J. Demetrovics
J.B. Kruskal
K. Rother
P. Erdős
T.N. Bhat
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2010
Field of study

Crossref

Repository of the Academy's Library

Deriving a mutation index of carcinogenicity using protein structure and protein interfaces

Author: A Custodio
A David
A Dixit
A Hamosh
A Pal
AJ Bass
Anna Tramontano
B Reva
B Vogelstein
CJ Richardson
CM Croce
D Chasman
D Sims
D Talavera
D Xu
E Krissinel
EC Chao
ER Mardis
F Damm
Frances Pearl
G Birrane
G De Baets
H Boutselakis
H Carter
H Makishima
IA Adzhubei
IS Moreira
J Carlsson
Jarle Hakas
JM Hurst
JM Izarzugaza
JR Morris
K Wang
Konstantinos Mitsopoulos
L Breiman
L Ding
M Li
M Magrane
Marketa Zvelebil
MR Stratton
MR Stratton
MS Greenblatt
MW MacArthur
MY Frederic
Octavio Espinosa
P Flicek
P Kumar
P Srivastava
PA Chan
PA Futreal
PB Crowley
PC Ng
PC Ng
PD Stenson
PH Lee
PT Wan
PV Hornbeck
PY Chou
R Ferla
R Rajasekaran
RJ Kinsella
S Jones
S Sunyaev
S Velankar
SA Forbes
TM Anne
V Ramensky
W Huang da
W Kabsch
X Wang
X Wang
Y Bromberg
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2014
Field of study

With the advent of Next Generation Sequencing the identification of mutations in the genomes of healthy and diseased tissues has become commonplace. While much progress has been made to elucidate the aetiology of disease processes in cancer, the contributions to disease that many individual mutations make remain to be characterised and their downstream consequences on cancer phenotypes remain to be understood. Missense mutations commonly occur in cancers and their consequences remain challenging to predict. However, this knowledge is becoming more vital, for both assessing disease progression and for stratifying drug treatment regimes. Coupled with structural data, comprehensive genomic databases of mutations such as the 1000 Genomes project and COSMIC give an opportunity to investigate general principles of how cancer mutations disrupt proteins and their interactions at the molecular and network level. We describe a comprehensive comparison of cancer and neutral missense mutations; by combining features derived from structural and interface properties we have developed a carcinogenicity predictor, InCa (Index of Carcinogenicity). Upon comparison with other methods, we observe that InCa can predict mutations that might not be detected by other methods. We also discuss general limitations shared by all predictors that attempt to predict driver mutations and discuss how this could impact high-throughput predictions. A web interface to a server implementation is publicly available at http://inca.icr.ac.uk/

CiteSeerX

Public Library of Science (PLOS)

Crossref

Directory of Open Access Journals

PubMed Central

Institute of Cancer Research Repository

Sussex Research Online

FigShare

MSDmotif: exploring protein sites and motifs

Author: A Golovin
A Golovin
A Prilc
A Prlic
Adel Golovin
AG Murzin
AJ Shepherd
AV Efimov
AV Efimov
BL Sibanda
C Bystroff
CA Orengo
CG Hunter
CH Wu
CT Porter
D Schomburg
DCP Kuhn
DI Stuart
DJ Craik
EJ Milner-White
EJ Milner-White
EJ Milner-White
ELL Sonnhammer
ELL Sonnhammer
H Boutselakis
H Kaur
H Kawasaki
HM Berman
ID Kuntz
J Lee
JD Watson
JD Watson
JYL Questel
KB Li
Kim Henrick
M Clamp
MJ Hartshorn
MR Nelson
N Hulo
ND Rawlings
RD Dowell
RD Finn
S Hayward
S Zhirong
SF Altschul
SF Altschul
T Hubbard
TJ Oldfield
TL Bailey
WJ Duddy
WR Pearson
Publication venue: BioMed Central
Publication date: 01/01/2008
Field of study

Abstract Background Protein structures have conserved features – motifs, which have a sufficient influence on the protein function. These motifs can be found in sequence as well as in 3D space. Understanding of these fragments is essential for 3D structure prediction, modelling and drug-design. The Protein Data Bank (PDB) is the source of this information however present search tools have limited 3D options to integrate protein sequence with its 3D structure. Results We describe here a web application for querying the PDB for ligands, binding sites, small 3D structural and sequence motifs and the underlying database. Novel algorithms for chemical fragments, 3D motifs, ϕ/ψ sequences, super-secondary structure motifs and for small 3D structural motif associations searches are incorporated. The interface provides functionality for visualization, search criteria creation, sequence and 3D multiple alignment options. MSDmotif is an integrated system where a results page is also a search form. A set of motif statistics is available for analysis. This set includes molecule and motif binding statistics, distribution of motif sequences, occurrence of an amino-acid within a motif, correlation of amino-acids side-chain charges within a motif and Ramachandran plots for each residue. The binding statistics are presented in association with properties that include a ligand fragment library. Access is also provided through the distributed Annotation System (DAS) protocol. An additional entry point facilitates XML requests with XML responses. Conclusion MSDmotif is unique by combining chemical, sequence and 3D data in a single search engine with a range of search and visualisation options. It provides multiple views of data found in the PDB archive for exploring protein structures.</p

Crossref

Springer - Publisher Connector

Directory of Open Access Journals

PubMed Central

PDBe: Protein Data Bank in Europe

Author: A. Golovin
A. Pajon
A. Suarez-Uruena
A. W. Sousa Da Silva
B. Beuth
Bernstein
C. Best
C. H. Boutselakis
C. J. Penkett
D. Dimitropoulos
Doreleijers
Dowell
E. B. Krissinel
Ebbert
Fogh
Fogh
G. J. Kleywegt
G. Sahni
G. van Ginkel
Golovin
Henrick
Holm
J. Pineda-Castillo
J. Swaminathan
K. Henrick
KELLEY
Kelley
Kelley
Kolodny
Kouranov
Le Questel
Lin
Lipman
Lukasik
M. Hirshberg
M. John
Milner-White
Milner-White
N. Cobley
R. Newman
R. Slowley
S. Sen
S. Velankar
Siddiqui
Swedlow
T. Oldfield
Tagari
Tagari
Velankar
Vranken
Vranken
W. F. Vranken
Watson
Yona
Publication venue: Oxford University Press
Publication date
Field of study

The Protein Data Bank in Europe (PDBe) (http://www.ebi.ac.uk/pdbe/) is actively working with its Worldwide Protein Data Bank partners to enhance the quality and consistency of the international archive of bio-macromolecular structure data, the Protein Data Bank (PDB). PDBe also works closely with its collaborators at the European Bioinformatics Institute and the scientific community around the world to enhance its databases and services by adding curated and actively maintained derived data to the existing structural data in the PDB. We have developed a new database infrastructure based on the remediated PDB archive data and a specially designed database for storing information on interactions between proteins and bound molecules. The group has developed new services that allow users to carry out simple textual queries or more complex 3D structure-based queries. The newly designed ‘PDBeView Atlas pages’ provide an overview of an individual PDB entry in a user-friendly layout and serve as a starting point to further explore the information available in the PDBe database. PDBe’s active involvement with the X-ray crystallography, Nuclear Magnetic Resonance spectroscopy and cryo-Electron Microscopy communities have resulted in improved tools for structure deposition and analysis

Crossref

PubMed Central

From protein sequences to 3D-structures and beyond: the example of the UniProt Knowledgebase

Author: A Andreeva
A Ben-Shem
A Chatr-aryamontri
A Cuff
A Gavin
A Hamosh
A Juncker
A Matte
AD Moore
AE Todd
AK Dunker
B Boeckmann
B Wollscheid
BH Dessailly
C Alfarano
C Bru
C Chothia
C Dodge
C Sala
C Yeats
CB Anfinsen
CH Wu
D Barrell
D Wilson
DA Benson
DH Haft
DH Shin
E Jain
E Zito
EA Bruford
EL Ulrich
F Chiti
F Kiefer
G Cochrane
G Lopez
G Zanotti
GE Tusnády
H Berman
H Boutselakis
H Mi
H Yu
HM Berman
I Letunic
I Xenarios
J Piatigorsky
J Rual
J Tamames
J White
JD Watson
JJW Wiltzius
JL Markley
JS Garavelli
K Degtyarenko
KD Pruitt
KD Pruitt
M Bauer
M Grabowski
M Hendlich
M Mueller
M Punta
M Revington
M Sickmeier
MA Hadders
ME Cusick
MI Ivanova
MJ Fogg
ML Benson
MR Sawaya
N Dephoure
N Farriol-Mathis
N Hulo
N Simonis
NJ Mulder
O Gileadi
OC Redfern
P Flicek
PG Bagos
R Gerber
R Nair
R Nelson
R Olson
R Rentzsch
RA Laskowski
RD Finn
S Addou
S Braconi Quintaje
S Dutta
S Hiller
S Hunter
S Kerrien
S Orchard
S Topiol
SB Long
SE Antonarakis
SI O’Donoghue
SJ Wodak
SM Johnson
ST Sherry
SW Cowan-Jacob
T Köcher
T Lima
The Uni Prot Consortium
U Pieper
Ursula Hinz
Y Jiang
Y Wang
YL Yip
Publication venue: SP Birkhäuser Verlag Basel
Publication date: 01/01/2009
Field of study

With the dramatic increase in the volume of experimental results in every domain of life sciences, assembling pertinent data and combining information from different fields has become a challenge. Information is dispersed over numerous specialized databases and is presented in many different formats. Rapid access to experiment-based information about well-characterized proteins helps predict the function of uncharacterized proteins identified by large-scale sequencing. In this context, universal knowledgebases play essential roles in providing access to data from complementary types of experiments and serving as hubs with cross-references to many specialized databases. This review outlines how the value of experimental data is optimized by combining high-quality protein sequences with complementary experimental results, including information derived from protein 3D-structures, using as an example the UniProt knowledgebase (UniProtKB) and the tools and links provided on its website (http://www.uniprot.org/). It also evokes precautions that are necessary for successful predictions and extrapolations

Crossref

Springer - Publisher Connector

Serveur académique lausannois

PubMed Central

UCL Discovery

E-MSD: the European Bioinformatics Institute Macromolecular Structure Database

Author: H. Boutselakis
Publication venue: 'Oxford University Press (OUP)'
Publication date
Field of study

Crossref

On the distance of databases

Author: G. Burosch
H. Boutselakis
H.M. Berman
J. Demetrovics
J.B. Kruskal
K. Rother
P. Erdős
T.N. Bhat
Publication venue: 'Springer Fachmedien Wiesbaden GmbH'
Publication date: 01/01/2010
Field of study

Crossref

SZTAKI Publication Repository

Repository of the Academy's Library

E-MSD: the European Bioinformatics Institute Macromolecular Structure Database

Author: Boutselakis H.
Copeland J.
Dimitropoulos D.
Fillon J.
Golovin A.
Henrick K.
Hussain A.
Ionides J.
John M.
Keller P. A.
Krissinel E.
McNeil P.
Naim A.
Newman R.
Oldfield T.
Pineda J.
Rachedi A.
Sitnov A.
Sobhany S.
Suarez-Uruena A.
Swaminathan J.
Tagari M.
Tate J.
Tromm S.
Velankar S.
Vranken W.
Publication venue: Oxford University Press
Publication date: 01/01/2003
Field of study

The E-MSD macromolecular structure relational database (http://www.ebi.ac.uk/msd) is designed to be a single access point for protein and nucleic acid structures and related information. The database is derived from Protein Data Bank (PDB) entries. Relational database technologies are used in a comprehensive cleaning procedure to ensure data uniformity across the whole archive. The search database contains an extensive set of derived properties, goodness-of-fit indicators, and links to other EBI databases including InterPro, GO, and SWISS-PROT, together with links to SCOP, CATH, PFAM and PROSITE. A generic search interface is available, coupled with a fast secondary structure domain search tool

CiteSeerX

PubMed Central

Crystal structure and mechanism of a bacterial fluorinating enzyme

Author: A Bateman
C Dong
C Schaffrath
C Schaffrath
Changjiang Dong
Christoph Schaffrath
D O'Hagan
D O'Hagan
D O'Hagan
David O'Hagan
DL Zechel
DMF van Aalten
Fanglu Huang
FH Allen
G Sandford
GN Murshudov
H Boutselakis
Hai Deng
HM Berman
J Dunitz
J Hutchinson
J Littlechild
J Mann
JAK Howard
James H. Naismith
Jonathan B. Spencer
KH vanPee
L Holm
M Sanada
OS Smart
PA Kollman
RJ Morris
S Bailey
S Doublie
SF Altschul
TC Terwilliger
X-H Xu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 05/02/2004
Field of study

Fluorine is the thirteenth most abundant element in the earth's crust, but fluoride concentrations in surface water are low and fluorinated metabolites are extremely rare. The fluoride ion is a potent nucleophile in its desolvated state, but is tightly hydrated in water and effectively inert. Low availability and a lack of chemical reactivity have largely excluded fluoride from biochemistry: in particular, fluorine's high redox potential precludes the haloperoxidase-type mechanism used in the metabolic incorporation of chloride and bromide ions. But fluorinated chemicals are growing in industrial importance, with applications in pharmaceuticals, agrochemicals and materials products. Reactive fluorination reagents requiring specialist process technologies are needed in industry and, although biological catalysts for these processes are highly sought after, only one enzyme that can convert fluoride to organic fluorine has been described. Streptomyces cattleya can form carbon-fluorine bonds and must therefore have evolved an enzyme able to overcome the chemical challenges of using aqueous fluoride. Here we report the sequence and three-dimensional structure of the first native fluorination enzyme, 5'-fluoro-5'-deoxyadenosine synthase, from this organism. Both substrate and products have been observed bound to the enzyme, enabling us to propose a nucleophilic substitution mechanism for this biological fluorination reaction

Crossref

University of East Anglia digital repository

University of St. Andrews - Pure

E-MSD: an integrated data resource for bioinformatics

Author: Barton G. J.
Boutselakis H.
Copeland J.
Dimitropoulos D.
Fillon J.
Golovin A.
Henrick K.
Hussain A.
Ionides J. M. C.
John M.
Keller P. A.
Krissinel E.
McNeil P.
Naim A.
Newman R.
Oldfield T. J.
Pajon A.
Pineda J.
Rachedi A.
Sitnov A.
Sobhany S.
Suarez-Uruena A.
Swaminathan G. J.
Tagari M.
Tate J. G.
Tromm S.
Velankar S.
Vranken W.
Publication venue: Oxford University Press
Publication date: 01/01/2004
Field of study

The Macromolecular Structure Database (MSD) group (http://www.ebi.ac.uk/msd/) continues to enhance the quality and consistency of macromolecular structure data in the Protein Data Bank (PDB) and to work towards the integration of various bioinformatics data resources. We have implemented a simple form-based interface that allows users to query the MSD directly. The MSD ‘atlas pages’ show all of the information in the MSD for a particular PDB entry. The group has designed new search interfaces aimed at specific areas of interest, such as the environment of ligands and the secondary structures of proteins. We have also implemented a novel search interface that begins to integrate separate MSD search services in a single graphical tool. We have worked closely with collaborators to build a new visualization tool that can present both structure and sequence data in a unified interface, and this data viewer is now used throughout the MSD services for the visualization and presentation of search results. Examples showcasing the functionality and power of these tools are available from tutorial webpages (http://www.ebi.ac.uk/msd-srv/docs/roadshow_tutorial/)

CiteSeerX

Crossref

PubMed Central

University of Dundee Online Publications